How Many People Visit YouTube? Imputing Missing Events in Panels With Excess Zeros
نویسندگان
چکیده
Media-metering panels track TV and online usage of people to analyze viewing behavior. However, panel data is often incomplete due to nonregistered devices, non-compliant panelists, or work usage. We thus propose a probabilistic model to impute missing events in data with excess zeros using a negative-binomial hurdle model for the unobserved events and beta-binomial sub-sampling to account for missingness. We then use the presented models to estimate the number of people in Germany who visit YouTube.
منابع مشابه
How Many Millennials Visit YouTube? Estimating Unobserved Events From Incomplete Panel Data Conditioned on Demographic Covariates
Many socio-economic studies rely on panel data as they also provide detailed demographic information about consumers. For example, advertisers use TV and web metering panels to estimate ads effectiveness in selected target demographics. However, panels often record only a fraction of all events due to non-registered devices, technical problems, or work usage. Goerg et al. (4) present a beta-bin...
متن کاملCross Panel Imputation
Many empirical microeconomics studies rely on consumer panels. For example, TV and web metering panels track TV and online usage of individuals to estimate reach and frequency of a campaign: reach is the fraction of the population that has been exposed to an ads and frequency measures how often they have seen it on average. As reach and frequency are used in media planning, i.e., optimal mix be...
متن کاملData Enrichment and Cross Panel Imputation
Many empirical microeconomics studies rely on consumer panels. For example, TV and web metering panels track TV and online usage of individuals to estimate reach and frequency of a campaign: reach is the fraction of the population that has been exposed to an ads and frequency measures how often they have seen it on average. As reach and frequency are used in media planning, i.e., optimal mix be...
متن کاملGeneral Methods and Algorithms for Modeling and Imputing Discrete Data under a Variety of Constraints
Loglinear modeling methods have become quite straightforward to apply to discrete data X. The models for missing data involve minor extensions of hot-deck methods (Little and Rubin 2002). Edits are structural zeros that forbid certain patterns. Winkler (2003) provided the theory for connecting edit with imputation. In this paper, we give methods and algorithms for modeling/edit/imputation under...
متن کاملInferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows
MOTIVATION Typical high-throughput genotyping techniques produce numerous missing calls that confound subsequent analyses, such as disease association studies. Common remedies for this problem include removing affected markers and/or samples or, otherwise, imputing the missing data. On small marker sets imputation is frequently based on a vote of the K-nearest-neighbor (KNN) haplotypes, but thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015